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DESCRIPTION 

METHODS AND COMPOSITIONS FOR TAILING AND AMPLIFYING RNA 
BACKGROUND OF THE INVENTION 

5 1. Field of the Invention 

The present invention relates generally to the field of molecular biology. 
More particularly, it concerns methods and compositions for tailing and amplifying 
non-polyadenylated RNA molecules, including microRNA, siRNA, tRNA, rRNA, 
synthetic RNA, and non-polyadenylated mRNA, such as mRNA from bacteria or 
10 fragments of prokaryotic or eukaryotic mRNAs such as may be encountered in 
degraded RNA samples. 

2. Description of Related Art 

The commonly used method for mRNA purification from eukaryotic cells, 
oligo-dT capture, is ineffective for capturing RNA molecules lacking poly(A) tails. 
15 Examples of RNA molecules that are not polyadenylated include microRNA, siRNA, 
tRNA, rRNA, synthetic RNA, and non-polyadenylated mRNA from prokaryotes. In 
addition, mRNA fragments, such as may be encountered in degraded RNA samples, 
may also lack poly(A) tails. The study of these important RNA molecules is hindered 
by inadequate methods for their isolation and amplification. 

20 For example, small RNAs, such as microRNA (miRNA) and small interfering 

RNA (siRNA), have emerged as powerful post-transcriptional regulators of gene 
expression in many different organisms, thus making the analysis of small RNA 
molecules increasingly important. To generate scientifically accurate data, the 
analysis of small RNA molecules requires their quantitative purification and 

25 amplification. There is a need for novel methods and compositions that are capable of 
quantitatively purifying and amplifying these important molecules. 

Another example of the difficulties encountered when analyzing non- 
polydenylated RNA molecules is the study of bacterial gene expression. The ability 
to evaluate global gene expression responses of bacterial pathogens during host cell 
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infections is seen as crucial for identifying novel bacterial targets for therapeutic 
intervention and for a comprehensive understanding of bacterial pathogenesis. 
However, methods for in vivo whole genome expression analyses have not been fully 
realized. Genome-wide analysis of bacterial gene expression, following growth in the 

5 presence of host cells in vitro or in vivo, has been referred to as the "Holy Grail" of 
pathogen expression analysis (Sassetti and Rubin, 2002). Others have more recently 
reiterated that this remains the "ultimate goal" of pathogen gene expression analysis 
(Schoolnik, 2002; Conway and Schoolnik, 2003). The technical difficulties that 
hinder these studies include (1) purifying bacterial RNA free from large amounts of 

10 contaminating host cell RNA, and (2) purifying adequate amounts of bacterial RNA 
for DNA microarray analysis. 

Bacteria respond rapidly to changes in their microenvironments. This quick 
adaptability enables them to survive and grow in all kinds of environmental 
conditions, including of course, eukaryotic host organisms. Many studies with 

15 bacterial pathogens have shown that bacteria exhibit altered gene/protein expression 
during interactions with host cells. In the past few years, host-induced changes in 
bacterial gene expression have been identified for Neisseria meningitidis (Taha et al, 
1998; Grifantini et al, 2002; Dietrich et al, 2003), Mycobacterium tuberculosis 
(Triccas et al, 1999), Salmonella spp. (Eriksson et al, 2003; Valdiva and Falkow, 

20 1997), Brucella suis (Boschiroli et al, 2002), and Legionella pneumophila (Abu 
Kwaik, 1998), to name a few. Analyses of bacterial gene expression during 
interactions with host cells will provide information on (1) the specific conditions of 
the host microenvironment (e.g., within phagocytic vacuoles, (Staudinger et al, 
2002), (2) mechanisms and pathways used by bacteria in response to those 

25 microenvironments, (3) proteins that are essential for survival and growth in vivo, (4) 
novel vaccine candidates, and (5) novel targets for antibiotics. 

Identifying bacterial genes whose expression is altered by host cells has not 
been easy, and microbiologists have used numerous techniques to address the 
challenge. These include mRNA-subtracted cDNA libraries (Scott-Craig et al, 1991; 
30 Plum and Clark-Curtiss, 1994), in vivo expression technology (IVET, (Mahan et al, 
1993), signature tagged mutagenesis (Hensel et al, 1995), gene fusions with lacZ 
(Taha et al, 1998) and gfp (Triccas et al, 1999; Valdiva and Falkow, 1997; 
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Boschiroli et aL, 2002), differential display PCR (Abu Kwaik and Pederson, 1996), 
one- and two-dimensional electrophoresis of cell proteins (Abu Kwaik, 1998; Abshire 
and Neidhardt, 1993; Monahan et aL, 2001), and cDNA selection (SCOTS-selective 
capture of transcribed sequences, (Graham and Clark-Curtiss, 1999). Many of these 

5 techniques have proven invaluable for the initial identification of specific bacterial 
genes whose expression is altered by interaction with host cells. However, all of these 
methods have limitations. Most require significant genetic manipulation (construction 
of mutants, libraries, gene fusions) and are labor intensive. Chief amongst the 
limitations is that none allows for quantification of genome-wide expression. Two- 

10 dimensional protein electrophoresis comes closest to allowing global expression 
analyses. However, at this point, it too is limited by resolution and sensitivity. 

A handful of research studies have appeared in which investigators have 
overcome the technical difficulties described above. Investigators at Chiron and The 
Institute For Genomic Research (TIGR) used microarrays to analyze gene expression 

1 5 of Neisseria meningitidis following adherence to cultured epithelial cells (Grifantini et 
aL, 2002). To reduce contaminating host-cell RNA, the epithelial cells were 
selectively lysed with saponin (a cholesterol-binding detergent), bacterial cells were 
harvested, and RNA was purified. Another group used SDS to selectively lyse 
macrophages containing phagocytized Salmonella enterica (Eriksson et aL, 2003). 

20 RNA was then purified from the harvested bacteria. Staudinger et aL (2002) 
harvested total RNA from neutrophils with internalized E. colL The authors reported 
that bacterial RNA was estimated to be 1/5 or more of the RNA isolated from the 
sample. Staudinger et aL used gene-specific primers for cDNA synthesis prior to E. 
coli array analysis. 

25 Selective eukaryotic cell lysis followed by harvesting of bacteria is a useful 

method for increasing the relative amount of bacterial RNA isolated from microbe- 
host cell mixtures. It is imperative that RNA be stabilized prior to such treatments. If 
it is not, detergent treatment will likely alter the gene expression profile of the 
bacteria. In addition, selective detergent lysis of host cells is less effective with many 

30 Gram-negative bacteria that are easily lysed with such detergents (e.g., Yersinia 
enter ocoliticd). Furthermore, variable susceptibility of eukaryotic cells to saponin- 
mediated lysis makes optimization difficult. Selective eukaryotic cell lysis may be 
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least useful with infected tissue samples that require vigorous homogenization to 
effect cell lysis. Nonetheless, selective lysis of host cells with detergents is a good 
idea that can be helpful for reducing excessive amounts of host cell RNA. 

The use of gene-specific primers with bacterial arrays is another method 
5 (Staudinger et aL 9 above) that has been used in hope of specifically priming bacterial 
RNA's. But, this results in the loss of -30% of hybridization signals on E. coli arrays 
(Khil and Camerini-Otero, 2002; Arfin et al 9 2000). 

A small percentage of bacterial mRNAs are poly(A)-tailed, but these are 
targeted for degradation and tend to be unstable. As a result, the commonly used 
method for mRNA purification with eukaryotic cells, oligo-dT capture, is ineffective 
for capturing bacterial mRNAs. Methods to polyadenylate bacterial mRNAs, thereby 
allowing for their purification by oligo dT-capture have been developed. Amara and 
Vijaya (1997) demonstrated that mRNAs in purified polysomes can be specifically 
polyadenylated and purified by oligo-dT capture. However, using their method 
Amara and Vijaya were not able to efficiently poly(A) tail the transcripts. For 
example, they reported that only 50% of the NS3 messages were polyadenylated. 
Wendisch et al (2001) showed that the same process can be carried out with crude 
cell extracts. 

As suggested above, several shortcomings are associated with the 
polyadenylation approach described in the prior art. Different mRNAs may be 
polyadenylated to different extents or not at all depending on the structure of their 5' 
and 3' ends (Feng et al, 2000). Polyadenylation in a cell lysate, followed by 
purification of RNA, will require inactivation of cellular RNAses so that transcripts 
are not degraded during the polyadenylation reaction. Optimizing the reaction to 
work reproducibly in many different bacterial cell lysates would likely be very 
difficult. 

If methods of polyadenylating bacterial mRNAs are to be used in genome- 
wide expression analysis, it is important that as much as possible, if not all, mRNAs 
in a population of bacterial mRNA are poly(A) tailed, thereby ensuring that they will 
30 be representatively purified by oligo-dT capture and/or templates for the 1 st strand 
cDNA synthesis reaction. However, using the standard reaction conditions for 
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commercially available PAP enzymes and synthetic RNA transcripts, the inventors 
found that only about 90-95% of a transcript could be tailed. 

Furthermore, it was also observed that with small mass amounts of RNA, such 
as those likely to be used for amplification (<100 ng), the tail lengths were extremely 

5 long - up to 9 kb in length. The inventors surmise that excessive tail-length could 
inhibit amplification reactions through several possible mechanisms: (1) reverse 
transcriptase may dissociate from the template during polymerization through 
extremely long homopolymeric A tracts; (2) dTTP may be effectively exhausted 
during 1 st strand synthesis, slowing the reaction rate; (3) excess poly(A) may itself be 

10 detrimental to reaction kinetics; (4) UTP may become limiting during the in vitro 
transcription step; and/or (5) T7 polymerase may be hindered while incorporating 
long U tracts at the 5' ends of antisense RNAs (aRNAs). , 

SUMMARY OF THE INVENTION 

15 The present invention provides methods and compositions for tailing and 

amplifying a targeted RNA molecule. A targeted RNA molecule may be any non- 
polyadenylated RNA molecule. Examples of non-polyadenylated RNA include 
microRNA, siRNA, tRNA, rRNA, synthetic RNA, or non-polyadenylated mRNA, 
such as mRNA from bacteria or fragments of prokaryotic or eukaryotic mRNAs such 

20 as may be encountered in degraded RNA samples. Advantages of the present 
invention include the ability to efficiently and uniformly tail a targeted RNA and the 
ability to representatively amplify a population of targeted RNA molecules in a 
sample. 

In a particular aspect, the present invention enables genome-wide expression 
25 analysis of bacterial genes. Significantly, the methods and compositions enable 
genome-wide expression analysis in circumstances where bacterial numbers were 
previously too low to purify adequate amounts of RNA for DNA microarray analysis 
or other applications. Such methods and compositions would be particularly useful 
for the study of bacterial gene expression during host-cell infection. 
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In one embodiment, the invention provides a method for increasing the 
efficiency of tailing a targeted RNA in a sample comprising altering the secondary 
structure of the targeted RNA, and incubating the targeted RNA in the presence of an 
enzyme that adds a nucleic acid tail under conditions that allow tailing of the targeted 
RNA. As used herein, "increasing the efficiency of tailing" refers to increasing the 
percentage of targeted RNA molecules that are tailed during the reaction. In preferred 
embodiments, at least about 75%, 80%, 85%, or 90% of the targeted RNA molecules 
in a sample are tailed. More preferably, at least about 95% of the targeted RNA 
molecules are tailed. Even more preferably, at least about 99% of the targeted RNA 
molecules are tailed. In certain embodiments, about 95%, 96%, 97%, 98%, 99%, or 
about 100% of the targeted RNA molecules are tailed. 

In another embodiment, the invention provides a method for limiting the 
length of the tail added to a targeted RNA. The method for limiting the length of the 
tail added to a targeted RNA comprises one or both of adding a reduced amount of 
nucleotide or performing the tailing in the presence of substantially no Mn . 
Preferably, the majority (i.e., greater that 50%) of tails added to the targeted RNA 
molecules is between about 10 and about 1000 bases in length. While at least the 
majority of tails added to the targeted RNA will be between about 10 and about 1000 
bases in length, some of the tails added to the targeted RNA may be shorter or longer. 
In some preferred embodiments, 60% or more, 70% or more, 80% or more, 90% or 
more, or 95% or more of the tails added to the targeted RNA molecules are between 
about 10 and about 1000 bases in length. Even more preferably, the majority of tails 
added to the targeted RNA molecules is between about 20 and about 500 bases in 
length. In some preferred embodiments, 60% or more, 70% or more, 80% or more, 
90% or more, or 95% or more of the tails added to the targeted RNA molecules are 
between about 20 and about 500 bases in length. The majority of tails added to the 
targeted RNA molecules may be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 
175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, or 500 bases in 
length or any range derivable therein. In certain embodiments, the majority of tails 
added to the targeted RNA molecules is less than about 500 bases in length. In some 
preferred embodiments, 60% or more, 70% or more, 80% or more, 90% or more, or 
95% or more of the tails added to the targeted RNA molecules are less than about 500 
bases in length. 
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In certain embodiments, the invention provides a method for an optimized 
tailing reaction. An "optimized tailing reaction" is a reaction that tails about 99% or 
more of the non-polyadenylated RNA in a sample, and that limits the length of the 
majority of tails added to the targeted RNA to no more than 500 bases. An optimized 
5 tailing reaction comprises altering the secondary structure of the RNAs, limiting the 
nucleotide concentration in the tailing reaction, using a buffer containing substantially 
no Mn 2+ , and limiting the tailing reaction time to 5-15 minutes at 37°C. 

The amount of nucleotide in the optimized tailing reaction will depend on how 
much RNA is present in the sample. For example, in samples with RNA amounts in 

10 the range of 1-100 ng, the nucleotide preferably is added to a concentration of 
between about 5 - 350 uM. More preferably, the nucleotide is added to a 
concentration of between about 10-100 uM. Even more preferably, the nucleotide is 
added to a concentration of between about 25 - 70 uM. For samples with greater 
amounts of RNA, more nucleotide will be needed. For samples with smaller amounts 

1 5 of RNA, less nucleotide will be needed. 

In certain aspects of the invention, altering the secondary structure of the 
targeted RNA comprises denaturing the targeted RNA. Denaturing the targeted RNA 
may be accomplished by, for example, heating the targeted RNA. In particular 
embodiments, the targeted RNA is heated for at least about 10 minutes at a 
20 temperature of at least about 70°C. In other aspects of the invention, altering the 
secondary structure of the targeted RNA comprises adding a single strand binding 
protein. In a particular aspect, the single strand binding protein is T4 Gene 32 protein. 

The target RNA that is tailed and/or amplified by the methods described 
herein may be any non-polyadenylated RNA, such as microRNA, siRNA, tRNA, 
25 rRNA, synthetic RNA, non-polyadenylated mRNA, or degraded mRNA. 

As used herein, "tailing" or "tagging" a targeted RNA molecule with a nucleic 
acid tail means covalently binding a nucleic acid sequence to the targeted RNA 
molecule. In preferred embodiments, the nucleic acid sequence is covalently bound to 
the targeted RNA molecule enzymatically. The nucleic acid sequence tail may be 
30 added to an end of the targeted RNA molecule. In a specific embodiment, the nucleic 
acid tail is added to the 3' end of the targeted RNA molecule. 
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The nucleic acid tail of the present invention may comprise any nucleic acid 
sequence. The nucleic acid sequence may comprise adenine, guanine, thymine, 
cytosine, uracil, or analogs thereof. In certain aspects, the targeted RNA molecule 
may be tailed with more than one tail. For example, an RNA molecule may be tailed 
5 first with a homopolymeric sequence, such as poly(A), and then tailed with at least a 
second homopolymeric sequence, such as poly(U). In another example, an RNA 
molecule may be tailed with a first sequence, such as a T3 RNA polymerase 
recognition sequence, and then tailed with at least a second sequence, such as a 
poly(A) sequence. 

1 0 In certain aspects, the nucleic acid sequence tail is a homopolymeric sequence, 

such as a poly(A), poly(T), poly(G), poly(C), or poly(U) sequence. The 
homopolymeric tail may be of any length, so long as it is capable of hybridizing to a 
complementary oligonucleotide primer for the initiation of cDNA synthesis. 
Preferably, the majority of homopolymeric tags is between about 10 and about 1000 

1 5 bases in length, where a majority is 50% or more, 60% or more, 70% or more, 80% or 
more, 90% or more, or 95% or more of the tails. Even more preferably, the majority 
of homopolymeric tails is between about 20 and about 500 bases in length. The 
majority of homopolymeric tails may be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 
150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, or 500 bases in 

20 length or any range derivable therein, hi certain embodiments, the majority of 
homopolymeric tails is less than 500 bases in length. 

In some embodiments, the homopolymeric tail is a poly(A) tail. The poly(A) 
tail may be of any length, so long as the tail is capable of hybridizing to a 
complementary oligonucleotide primer for the initiation of cDNA synthesis. In 

25 certain aspects, the majority of poly(A) tails is between about 10 and about 1000 
adenines in length, where a majority is 50% or more, 60% or more, 70% or more, 
80% or more, 90% or more, or 95% or more of the tails. In some aspects, the 
majority of poly(A) tails is between about 20 and about 500 adenines in length. The 
majority of poly(A) tails may be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 

30 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, or 500 bases in 
length or any range derivable therein. In certain embodiments, the majority of 
poly(A) tails is less than 500 bases in length. 
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In preferred embodiments, at least about 90% of the targeted RNA molecules 
in a sample are tailed. More preferably, at least about 95% of the targeted RNA 
molecules are tailed. Even more preferably, at least about 99% of the targeted RNA 
molecules are tailed. In certain embodiments, about 95%, 96%, 97%, 98%, 99%, or 
5 about 1 00% of the targeted RNA molecules are tailed. 

In one embodiment, the method for limiting the length of the tail added to a 
targeted RNA comprises adding a reduced amount of a nucleotide to the tailing 
reaction. 

A "reduced amount of a nucleotide" as used herein refers to a nucleotide 

10 concentration of 350 uM or lower for RNA amounts in the range of from about 1 ng 
to about 100 ng. m certain embodiments, the nucleotide is added to a concentration 
of between about 10-100 uM for RNA amounts in the range of from about 1 ng to 
about 100 ng. More preferably, the nucleotide is added to a concentration of between 
about 25 - 70 uM for RNA amounts in the range of from about 1 ng to about 100 ng. 

1 5 In certain aspects of the invention, the nucleotide is added to a concentration of about 
25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 45, 46, 47, 48, 
49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or 70 
uM for RNA amounts in the range of from about 1 ng to about 100 ng. As will be 
understood by those of ordinary skill in the art, the nucleotide concentration will vary 

20 depending on the amount of RNA in the sample. Nucleotide concentrations between 
about 25 - 70 uM are preferred for between about 1 -100 ng of RNA. For samples 
with RNA in amounts greater than 100 ng, more nucleotide may be used. Likewise, 
for samples with target RNA in amounts less than 1 ng, less nucleotide may be used. 
Based on these teachings, those of ordinary skill in the art will be able to determine 

25 the appropriate concentration of nucleotide to be used for a given sample. 

In embodiments where the nucleic acid tail being added to the targeted RNA is 
a poly(A) tail, the method may further comprise adding ATP. The amount of ATP 
added will depend on how much target RNA is present in the sample. In one 
embodiment, the method comprises adding a reduced amount of ATP to the tailing 
30 reaction. A "reduced amount of ATP" as used herein refers to ATP added to a 
concentration of between about 5 - 350 uM for RNA amounts between about 1-100 
ng. More preferably, the ATP is added to a concentration of between about 10-100 
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15 



pM for RNA amounts between about 1-100 ng. Even more preferably, the ATP is 
added to a concentration of between about 25 - 70 uM for RNA amounts between 
about 1-100 ng. In certain aspects of the invention, the ATP is added to a 
concentration of about 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 

5 42, 43, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, or 70 uM for RNA amounts between about 1-100 ng. For samples 
with target RNA in amounts greater than 100 ng, more ATP may be used. Likewise, 
for samples with target RNA in amounts less than 1 ng, less ATP may be used. Based 
on these teachings, those of ordinary skill in the art will be able to determine the 

10 appropriate concentration of ATP to be used for a given sample. 

Standard buffers useful for tailing a nucleic acid with a homopolymeric 
sequence are described in, for example, Eun et al, 1996; and Cao and Sarkar, 1997, 
both of which are incorporated by reference. The inventors have developed a tailing 
reaction that limits the length of the tail added to the targeted RNA by employing an 
"optimized tailing buffer." As used herein, an "optimized tailing buffer" refers to a 
tailing buffer having substantially no Mn 2+ . In one embodiment, the "optimized 
tailing buffer" at lx concentration contains: 50mM Tris-HCl, pH 8.3; 75 mM KC1; 3 
mM MgCl 2 ; and 10 mM DTT. 

The "tailing enzyme" of the present invention may be any enzyme capable of 
adding a nucleic acid tail to a targeted RNA molecule, hi preferred embodiments the 
tailing enzyme is an enzyme capable of synthesizing a homopolymeric tail. Those of 
ordinary skill in the art are familiar with various enzymes capable of synthesizing 
homopolymeric tails. For example, poly(A) polymerases such as poly(A) polymerase 
I of E. coli or yeast poly(A) polymerase may be used. Poly(A)polymerases from 
mammalian or viral sources may also be used. Recombinant poly(A)polymerase 
enzymes may be used in the methods and compositions of the present invention. 
Poly(A) polymerase enzymes are described in, for example, Yehudai-Resheff, S. et al. 
(2000); Mohanty and Kushner (1999); Mohanty and Kushner (2000); Cao and Sarkar 
(1997); Raynal et al. (1996), all of which are incorporated by reference. 

Polynucleotide phosphorylase enzymes could also be used for adding a 
homopolymeric tail to a targeted RNA molecule. The polyadenylation of RNA using 
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polynucleotide phosphorylase is described in, for example, Engel and Davidson 
(1978), incorporated by reference. 

In certain embodiments of the invention, a nucleotidyl transferase may be used 
to tail a targeted RNA. Poly(A) polymerase enzymes have sequence similarity with 
5 enzymes of the nucleotidyl transferase family. Studies have indicated that some 
members of the nucleotidyl transferase family have poly(A) polymerase activity 
(Read and Norbury (2002), incorporated by reference.) 

It is specifically contemplated that in some embodiments the tailing enzyme 
employed to tail the targeted RNA molecule is not a ligase ("non-ligase enzyme"). 

1 0 An RNA sample may be any sample that comprises a targeted RNA molecule. 

The RNA sample may be obtained from a cell, cell culture, a body fluid, a tissue, or 
an organ. In certain embodiments, the sample is a fixed sample or a frozen sample, 
such as a fixed tissue or frozen tissue sample. The sample may be an environmental 
sample. Examples of environmental samples include soil samples, water samples, and 

15 air samples. 

In certain embodiments, the RNA sample comprises a mixture of eukaryotic 
RNA and bacterial RNA. In some embodiments, the RNA sample comprises RNA 
from more than one bacterial species. 

hi some embodiments, the methods involve adding a single strand binding 
20 protein. Single strand binding proteins are proteins that specifically bind to single 
stranded nucleic acid molecules. Single strand binding proteins can stabilize single- 
strand regions of DNA or RNA. Numerous single strand binding proteins are known 
in the art. In one embodiment, the single strand binding protein is T4 Gene 32 
protein. In another embodiment, the single strand binding protein is SSB from E. coli. 
25 Both T4 Gene 32 protein and SSB are available commercially from USB Corp. 

Certain embodiments of the invention involve volume excluding reagents. In 
one embodiment, a volume excluding reagent is included in the reaction when tailing 
a targeted RNA molecule. Volume excluding reagents include, for example 
polyethylene glycol (PEG) and dextran. 
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In certain embodiments of the invention, the method further comprises 
depleting DNA from the sample. Methods of depleting DNA or separating DNA 
from RNA are well known to those skilled in the art. One common approach for 
depleting DNA is to incubate the sample with DNase. Another approach is an acid 
5 phenolxhloroform extraction. Acid phenol: chloroform (e.g., 5:1 phenol:CHCl 3 ; pH 
4.7) extraction partitions DNA in to the organic phase. The RNA remains in the 
aqueous phase and can be subsequently recovered by precipitation. Yet another 
method for separating DNA from RNA is lithium chloride precipitation. LiCl 
precipitation selectively precipitates RNA. It inefficiently precipitates DNA, which is 
10 discarded in the supernatant. Filter based RNA isolation systems, such as 
RNAqueous® (Ambion, Austin, TX), are also known in the art. 

In some embodiments of the invention, the method further comprises 
depleting polyadenylated mRNA from a sample prior to tailing the targeted RNA. 
Methods for specifically isolating polyadenylated mRNA from a sample are well 

15 known to those of ordinary skill in the art. For example, a common method for 
isolating polyadenylated mRNA comprises hybridizing the polyadenylated mRNA to 
a poly(T) oligonucleotide. Typically, the poly(T) oligonucleotide is attached to a 
surface, such as a column or a bead. After the polyadenylated mRNA is hybridized to 
the poly(T) oligonucleotide, it can be separated from the sample. For example, if the 

20 polyadenylated mRNA is hybridized to the poly(T) oligonucleotide immobilized on a 
magnetic bead. The beads may then be separated from the sample using a magnet. 

In other aspects of the invention, the method further comprises depleting 
rRNA from the sample. Depending on the composition of the sample, it may be 
desirable to deplete eukaryotic rRNA, bacterial rRNA, or both. For example, 

25 eukaryotic rRNA may hybridized with one or more oligonucleotides complementary 
to at least a portion of one or more of the 17S rRNA, 18S rRNA, or 28S rRNA. 
Similarly, bacterial rRNA may be hybridized with one or more oligonucleotides 
complementary to at least a portion of one or more of the 16S rRNA or 23S rRNA. 
The hybridization complexes are then removed from the sample with an appropriate 

30 capture system. Typically, the oligonucleotides are immobilized on a surface, which 
enables the removal of the hybridization complexes. MICROBExpress™ and 
MICROBEnrich™ (Ambion, Austin, TX) are examples of commercially available 



-12- 



WO 2006/029350 



PCT/US2005/032227 



kits for the depletion of rRNA. Methods and compositions for the depletion or rRNA 
from a sample are described in U.S. Application Serial No. 10/029,397, which is 
incorporated by reference. 

Depleting rRNA from the sample can enable the relative enrichment of 
5 targeted RNA to total RNA in a sample by about or at least about 5, 10, 20, 30, 40, 
50, 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 fold or any range derivable 
therein. 

In certain aspects, the methods and compositions of the present invention 
enables the enrichments of targeted RNA in a sample such that about or at least about 
10 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 
100%, or any range derivable therein, of the RNA is targeted RNA. 

In additional aspects of the invention, the method comprises amplifying the 
targeted RNA after it is tailed. In certain aspects, amplifying the tailed RNA 
comprises: hybridizing the tailed RNA with a promoter-oligo-dN primer, where N is 

15 a nucleotide complementary to the tail on the targeted RNA; extending the promoter- 
oligo-dN primer using a reverse transcriptase to form a first strand DNA 
complementary to the tailed RNA; synthesizing a second strand DNA complementary 
to the first strand; and transcribing copies of RNA initiated from the promoter-oligo- 
dN primer using an RNA polymerase, wherein the RNA is complementary to the 

20 second strand DNA. The transcribed RNA represents the anti-sense RNA strand. 

In some embodiments, the targeted RNA being amplified is poly(A)-tailed. In 
certain aspects, amplifying the poly(A)-tailed RNA comprises: hybridizing the 
poly(A)-tailed RNA with a promoter-oligo-dT primer; extending the promoter-oligo- 
dT primer using a reverse transcriptase to form a first strand DNA complementary to 
25 the poly(A)-tailed RNA; synthesizing a second strand DNA complementary to the 
first strand; and transcribing copies of RNA initiated from the promoter-oligo-dT 
primer using an RNA polymerase, wherein the RNA is complementary to the second 
strand DNA. The transcribed RNA represents the anti-sense RNA strand. 

In some embodiments, the methods and compositions enable at least a 500, 
30 750, 1,000, 1,250, 1,500, 1,750, 2,000, 2,250, 2,500, 2,750, 3,000, 4,000, 5,000, 
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6,000, 7,000, 8,000, 9,000, 10,000, or 20,000 fold or more amplification of the 
targeted RNA. 

The RNA polymerase may be, for example, a T bacteriophage RNA 
polymerase or an SP6 RNA polymerase. Preferably, the T bacteriophage RNA 
5 polymerase is T7 RNA polymerase or T3 RNA polymerase. 

In some preferred embodiments the promoter-oligo-dT primer is a T7-oligo- 
dT, T7-oligo~dTV, or T7-oligo-dTVN. In some embodiments, the T7-oligo-dT is T7- 
dT 2 4VN. In one preferred embodiment the T7-oligo-dT sequence is 5'- 
GGTAATACGACTCACTATAGGGAGAAGAG(T) 24 VN-3' (SEQ ID NO. 1). 

It will be obvious to those of skill in the art that any reverse transcriptase may 
be used in the present invention. In preferred embodiments, the reverse transcriptase 
is Moloney murine leukemia virus (MMLV) reverse transcriptase or avian 
myeloblastosis virus (AMV) reverse transcriptase. The reverse transcriptase may be a 
mutant reverse transcriptase, as long as the mutants retain cDNA synthesizing 
activity. Examples of reverse transcriptase mutants include those with reduced or 
absent RnaseH activity (e.g., Superscript™ II, Superscript™ III, and ThermoScript™ 
(Invitrogen)) and those with enhanced activity at higher temperatures (Superscript™ 
III and ThermoScript™ (Invitrogen)). In one preferred embodiment the reverse 
transcriptase is Arrayscript™ (Ambion), which is a mutant MMLV with reduced 
RnaseH activity. 

In some embodiments, the invention comprises using a targeted RNA for 
expression analysis. Methods of gene expression analysis are well known in the art. 
In preferred embodiments, the expression analysis comprises hybridizing the targeted 
RNA to a microarray. The expression analysis may be performed on either amplified 
25 or unamplified, targeted RNA. In a preferred embodiment, amplified RNA is used for 
expression analysis. 

In certain aspects of the invention, the targeted RNA and/or amplified RNA 
are labeled. Labeling the targeted RNA and/or amplified RNA facilitates the 
detection of the molecules in applications such as expression analysis. A number of 
30 different labels may be used in the present invention such as fluorophores, 
chromophores, radiophores, enzymatic tags, antibodies, chemiluminescence, 
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electroluminescence, and affinity labels. Those of skill in the art are familiar with 
methods for labeling nucleic acids and will recognize that these and other labels not 
mentioned herein can be used with success in this invention. 

Examples of affinity labels include, but are not limited to the following: an 
5 antibody, an antibody fragment, a receptor protein, a hormone, biotin, DNP, or any 
polypeptide/protein molecule that binds to an affinity label. 

Examples of enzyme tags include enzymes such as urease, alkaline 
phosphatase or peroxidase to mention a few. Colorimetric indicator substrates can be 
employed to provide a detection means visible to the human eye or 
10 spectrophotometrically, to identify specific hybridization with complementary nucleic 
acid-containing samples. All of these examples are generally known in the art and the 
skilled artisan will recognize that the invention is not limited to the examples 
described above. 

Examples of fluorophores include, but are not limited to the following: Alexa 
350, Alexa 430, AMCA, BODIPY 630/650, BODIPY 650/665, BODIPY-FL, 
BODIPY-R6G, BODJPY-TMR, BODIPY-TRX, Cascade Blue, Cy2, Cy3, Cy 3.5, 
Cy5, Cy5.5, Cy7, 6-FAM, Fluoroscein, HEX, 6-JOE, Oregon Green 488, Oregon 
Green 500, Oregon Green 514, Pacific Blue, REG, Rhodamine Green, Rhodamine 
Red, ROX, TAMRA, TET, Tetramethylrhodamine, lissamine, phycoerythrin, FluorX, 
and Texas Red. 

In some embodiments, the invention provides a method for amplifying 
bacterial mRNA from a sample comprising bacterial cells of a single species or 
bacterial cells of multiple species comprising: isolating RNA from the sample; tailing 
the bacterial mRNA; and amplifying the bacterial mRNA. 

In other embodiments, the invention provides a method for amplifying 
bacterial mRNA from a sample comprising bacterial cells of a single species or 
bacterial cells of multiple species and eukaryotic cells comprising: isolating RNA 
from the sample; depleting polyadenylated mRNA from the isolated RNA; depleting 
rRNA from the isolated RNA; tailing the bacterial mRNA; and amplifying the 
bacterial mRNA. 
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In some embodiments, the invention provides a method for amplifying 
bacterial mRNA from a sample comprising bacterial cells and eukaryotic cells 
comprising: isolating RNA from the sample; depleting polyadenylated mRNA from 
the isolated RNA; depleting rRNA from the isolated RNA; tailing the bacterial 
5 mRNA; and amplifying the bacterial mRNA. 

In other embodiments, the invention provides a method for analyzing bacterial 
mRNA expression in a sample comprising bacterial RNA and eukaryotic RNA 
comprising: isolating the bacterial RNA and the eukaryotic RNA from the sample to 
form an RNA composition; depleting polyadenylated mRNA and rRNA from the 
10 RNA composition resulting in a relative enrichment of bacterial mRNA in the RNA 
composition; polyadenylating the bacterial mRNA; and amplifying the bacterial 
mRNA, wherein the amplified bacterial mRNA is used for expression analysis. 

In certain embodiments, the method comprises stabilizing RNA in the sample. 
Any method of stabilizing RNA in a sample that is known in the art may be used in 
15 the practice of the present invention. Commercially available products for stabilizing 
RNA include RNA/afer® from Ambion (Austin, TX) and RNAprotect™ Bacteria 
Reagent from Qiageh (Valencia, CA). Examples of other methods for stabilizing 
RNA include phenoLEtOH mixtures (Eriksson et al, 2003), incorporated by 
reference), and acetone/EtOH mixtures. 

20 In some embodiments, the present invention provides an improvement in a 

method for polyadenylating bacterial mRNA, wherein the improvement comprises 
altering the secondary structure of the mRNA prior to incubating the mRNA with a 
poly(A) polymerase and polyadenylating the mRNA in the presence of substantially 
no Mn 2+ . Altering the secondary structure of the mRNA may comprise, for example, 

25 denaturing the mRNA or adding a single strand binding protein. 

Other aspects of the invention provide kits. In certain embodiments, the kit, in 
suitable container means, comprises: a poly(A) polymerase; ATP; an optimized 
tailing buffer; and a control RNA. 

In certain embodiments, the kit further comprises one or more of: an RNA 
30 polymerase; a single strand binding protein; ethylenediaminetetraacetic acid (EDTA); 
a reverse transcriptase; a first strand buffer; a promoter-oligo-dT primer; a dNTP mix; 
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a ribonuclease inhibitor; a second strand buffer; a DNA polymerase; RNase H; 
nuclease free water; an RNA polymerase reaction buffer; ATP; CTP; GTP; UTP; 
TTP; DNase I; a cDNA binding buffer; a cDNA wash buffer; an aRNA binding 
buffer; an aRNA wash buffer; an elution solution; an aRNA filter cartridge; a cDNA 
5 filter cartridge; or collection tubes. 

In some embodiments, the poly(A) polymerase is poly(A) polymerase I of E. 
coli or yeast poly(A) polymerase. 

In certain embodiments, the promoter-oligo-dT primer promotes transcription 
by a T bacteriophage RNA polymerase, such as T7 RNA polymerase or T3 RNA 
10 polymerase. In other embodiments, the promoter-oligo-dT primer is an SP6-oligo-dT 
primer. In a preferred embodiment the promoter-oligo-dT primer is a T7-oligo-dT 
primer. In certain embodiments the T7-oligo-dT primer is T7-dT 2 4VN. 

The RNA polymerase may be, for example, T7 RNA polymerase, T3 RNA 
polymerases SP6 RNA polymerase. 

15 Non-limiting examples of single strand binding proteins include T4 Gene 32 

and E. coli SSB. A non-limiting example of a ribonuclease inhibitor is placental 
ribonuclease inhibitor. Examples of DNA polymerases include E. coli DNA 
polymerase and T4 DNA polymerase. 

It is contemplated that any method or composition described herein can be 
20 implemented with respect to any other method or composition described herein. 

The use of the term "or" in the claims is used to mean " and/or" unless 
explicitly indicated to refer to alternatives only or the alternatives are mutually 
exclusive, although the disclosure supports a definition that refers to only alternatives 
and "and/or." 

25 Throughout this application, the term "about" is used to indicate that a value 

includes the standard deviation of error for the device or method being employed to 
determine the value. 
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Following long-standing patent law, the words "a" and "an," when used in 
conjunction with the word "comprising" in the claims or specification, denotes one or 
more, unless specifically noted. 

Other objects, features and advantages of the present invention will become 
5 apparent from the following detailed description. It should be understood, however, 
that the detailed description and the specific examples, while indicating specific 
embodiments of the invention, are given by way of illustration only, since various 
changes and modifications within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed description. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included 
to further demonstrate certain aspects of the present invention. The invention may be 
better understood by reference to one or more of these drawings in combination with 
the detailed description of specific embodiments presented herein. 

15 FIG. 1. FIG. 1 shows an Agilent Bioanalyzer electropherogram 

depicting the extent of polyadenylation of a synthetic GAPDH transcript that was 
polyadenylated using poly(A) polymerase I. The untailed transcript in the control 
reaction (no poly(A) polymerase I) is -315 nucleotides. The majority of the tailed 
transcript is -570 nucleotides. However, some transcripts are tailed to greater 

20 lengths. 

FIG. 2. FIG. 2 shows an Agilent Bioanalyzer electropherogram 
depicting the extent of polyadenylation for reactions with or without MnCfe, and with 
reaction times of 5, 10, or 15 minutes. When the reaction is performed with MnCl 2 at 
a concentration of 2.5 mM, tail length extends from about 0.5 Kb to beyond 6 Kb. 
25 When MnCl 2 is not added to the reaction, tail length is reduced to a median size of 
about 150 nucleotides. While incubation time has some effect on tail length, it is 
small relative to the MnCl 2 effect. 
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DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

A. THE PRESENT INVENTION 

The present invention provides methods and compositions for tailing and 
5 amplifying a targeted RNA molecule. A targeted RNA molecule may be any non- 
polyadenylated RNA. Examples of non-polyadenylated RNA include microRNA, 
siRNA, tRNA, rRNA, synthetic RNA, or non-polyadenylated mRNA, such as mRNA 
from bacteria or degraded mRNA from prokaryotic or eukaryotic organisms. 
Advantages of the present invention include the ability to efficiently and uniformly 
10 tail a targeted RNA and the ability to representatively amplify a population of targeted 
RNA molecules in a sample. The ability to representatively amplify a population of 
targeted RNA molecules in a sample provides a significant advantage over the prior 
art in that it enables the study of samples containing RNA molecules in mass amounts 
previously too low to be analyzed. 

15 in one aspect, the present invention provides methods and compositions that 

enable genome-wide expression analysis of bacterial genes. Significantly, the 
methods enable genome-wide expression analysis in circumstances where bacterial 
numbers were previously too low to purify adequate amounts of RNA for DNA 
microarray analysis or other applications. Such methods are particularly useful for 

20 the study of bacterial gene expression during host-cell infection, and for the study of 
gene expression in bacteria present in environmental samples. 

In one embodiment, the present invention provides a method for genome-wide 
expression analysis that employs an "optimized" polyadenylation reaction followed 
by first strand cDNA synthesis initiated from a T7-oligo-dT primer, and then 
25 transcription of aRNA using an Eberwine type amplification reaction with a mutant 
MMLV RT. The aRNA is then used for microarray analysis. 

An "optimized" polyadenylation reaction is a reaction that polyadenylates 
about 99% or more of the non-polyadenylated RNA in a sample, and that limits the 
length of the majority of poly(A) tags to no more than 500 bases. An "optimized" 
30 polyadenylation reaction comprises denaturing the RNAs, limiting the ATP 
concentration in the poly(A) polymerase reaction, using a buffer lacking Mn 2+ , and 
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limiting the poly(A) polymerase reaction time to 5-15 minutes at 37°C. In some 
embodiments, the "optimized" polyadenylation reaction may comprise adding a single 
strand binding protein to the reaction. 

The amount of ATP in the "optimized" polyadenylation reaction will depend 
5 on how much RNA is present in the sample. For example, in samples with RNA 
amounts in the range of 1-100 ng, the ATP preferably is added to a concentration of 
between about 5 - 350 uM. More preferably, the ATP is added to a concentration of 
between about 10-100 uM. Even more preferably, the ATP is added to a 
concentration of between about 25 - 70 uM. For samples with greater amounts of 
10 RNA, more ATP will be needed. For samples with smaller amounts of RNA, less 
ATP will be needed. 

In other embodiments, the present invention provides kits. In a preferred 
embodiment, the kits comprise, in suitable container means, a poly(A) polymerase, 
ATP, a control RNA, and an optimized tailing buffer. 

15 B. RNA 

There are several types of naturally occurring RNA molecules including 
messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), 
microRNA, and siRNA. These molecules performs many different functions in the 
cell. mRNA, which conveys information from the nucleus to the cytoplasm in 

20 eukaryotes, is the most intensely studied. Several molecular biology procedures use 
purified mRNA as starting material or are designed to work primarily with mRNA. 
These procedures include: cDNA synthesis (for library construction, RT-PCR 
analysis, or 5' end analysis through primer extension); Northern blot analysis; 
ribonuclease protection assays; screening procedures involving in vitro translation; 

25 and gene expression analysis with microarrays. 

There are several existmg procedures to purify RNA from various biological 
samples. However, mRNA represents only 1-5% of the mass of total RNA 
(Sambrook, 2001). Of the remainder, the major RNA species is ribosomal RNA 
(rRNA), constituting 80% or more of total RNA mass (Sambrook et al, 1989 and 
30 2001). Although the total RNA isolated from cells can sometimes be used for the 
above-mentioned procedures, usually a preliminary purification of mRNA from total 
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RNA is often preferred, if not required. This is especially true if the particular mRNA 
being sought or targeted is in low abundance (0.5% or less of the mRNA population). 
The presence of rRNA can interfere in the detection of mRNA by Northern blotting, 
RNase protection assays, differential display analysis, and expression profiling by 
5 gene arrays, especially if the target being analyzed is in low abundance. Often, the 
mRNA from scientifically interesting genes falls into this category. 

In eukaryotic cells, mRNAs are polyadenylated at their 3'-termini, unlike 
rRNAs and tRNAs. The poly(A) tails of eukaryotic mRNAs usually consist of about 
200 adenosyl residues and allow them to be purified based on hybridization to 

10 oligo(dT) nucleotides. hi contrast, bacterial mRNAs are not uniformly 
polyadenylated and therefore cannot be purified by oligo(dT) hybridization. The lack 
of an efficient method to purify and/or amplify bacterial mRNA has hampered in vivo 
whole-genome expression analyses. Genome-wide expression analysis of bacterial 
genes is made even more difficult in circumstances where the pool of bacterial mRNA 

15 is limited, such as in host-cell infection or in some environmental samples. 

In addition to bacterial mRNA, the methods and compositions of the present 
invention may be used in conjunction with any non-polyadenylated RNA including, 
for example, siRNA, microRNA, tRNA, rRNA, synthetic RNA, and degraded mRNA. 
Methods and compositions relating to siRNA molecules are described, for example, in 
20 U.S. Applications 10/298,480, 10/360,772, 10/460,775, and 10/355,820, each of 
which in incorporated herein by reference. Methods and compositions relating to 
isolating, manipulating, and using microRNA molecules are described, for example, 
in U.S. Application 10/667,126 and U.S. Application 60/575,743, each of which is 
incorporated herein by reference. 

25 1- Stabilizing RNA 

Obtaining high quality, intact RNA is important for quantitative and 
qualitative analysis of RNA expression. To obtain high quality RNA it is necessary to 
minimize the activity of RNase liberated during cell lysis and to prevent RNA 
degradation from other sources. This is normally accomplished by using isolation 
30 methods that disrupt tissues and inactivate or inhibit RNases simultaneously. 
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For specimens low in endogenous ribonuclease, isolation protocols commonly 
use extraction buffers containing detergents to solubilize membranes, and inhibitors 
of RNase such as placental ribonuclease inhibitor or vanadylribonucleoside 
complexes. RNA isolation from more challenging samples, such as intact tissues or 

5 cells high in endogenous ribonuclease, requires a more aggressive approach. In these 
cases, the tissue or cells are quickly homogenized in a powerful protein denaturant 
(usually guanidinium isothiocyanate), to irreversibly inactivate nucleases and 
solubilize cell membranes. If a tissue sample can not be promptly homogenized, it 
must be rapidly frozen by immersion in liquid nitrogen, and stored at -80°C. Samples 

10 frozen in this manner should not be thawed prior to RNA isolation or the RNA will be 
rapidly degraded by RNase liberated during the cell lysis that occurs during freezing. 

RNA preservation reagents that can protect the RNA in a tissue or cell sample 
from nucleases at temperatures above the freezing point are also know in the art, and 
are described, for example, in U.S. Patents 6,528,641 and 6,204,375, incorporated 
1 5 herein by reference. 

2. Isolation of RNA 

Methods of isolating RNA are known to those of skill in the art, and it is 
contemplated that any RNA isolation or purification scheme known in the art could be 
used in the context of the present invention. For example, Filter-based methods for 
20 the isolation of RNA are also known in the art. One example is Ambion f s 
RNAqueous® Technology. RNAqueous® is a rapid, filter-based RNA isolation 
system that does not require the use of phenol, chloroform or other toxic organic 
chemicals. 

hi addition, commercially available kits such as Ambion's RiboPure™ RNA 
25 Isolation Kit, RiboPure™-Bacteria RNA Isolation Kit, and RiboPure™-Yeast RNA 
Isolation Kit may be used to isolate RNA. Additional methods for isolating RNA are 
described, for example, in U.S. Application 09/854,412, incorporated herein by 
reference. Methods and compositions for isolating RNA from fixed tissues are 
described, for example, in U.S. Application 60/490,325, incorporated herein by 
30 reference. It is also contemplated that the chemical synthesis of RNA in situ can be 
used to prepare RNA. 
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3. Separation of DNA and RNA 

In certain embodiments of the invention, it is desirable to remove DNA from 
RNA samples. DNA contamination can interfere with the quantitative analysis of 
mRNA. For example, DNA contamination can cause false positives in quantitative 
5 RT-PCR. 

Methods of DNA removal from RNA samples are known to those skilled in 
the art. Examples of common methods include DNase digestion, acid 
phenolxhloroform extraction, and LiCl precipitation. 

DNase is an endonuclease that cleaves DNA. It must be inactivated or 
10 removed from the reaction prior to PCR, otherwise, it may digest newly amplified 
DNA. Acid phenolxhloroform (5:1 phenol:CHCl 3 ; pH 4.7) extraction partitions 
DNA in to the organic phase. The RNA remains in the aqueous phase and can be 
subsequently recovered by precipitation. LiCl is a selective precipitant of RNA. It 
inefficiently precipitates DNA, which is discarded in the supernatant. 

1 5 4. Separation of rRNA from mRNA 

Ribosomal RNAs can make up as much as 80% or more of the total RNA in a 
sample. It is often desirable to separate mRNA from rRNA because rRNA can 
adversely affect the quantitative analysis of mRNA. For example, the presence of 
rRNA can increase background in mRNA samples, resulting in variations in mRNA 

20 concentration between samples and decreasing the efficiency of cDNA probe 
synthesis. In eukaryotic organisms, mRNAs have a poly(A) tail at the 3' end. This 
can be used to separate these molecules away from rRNA and other non-mRNA 
species that lack this "polyA tail." Bacterial mRNA, however, are not uniformly 
polyadenylated and consequently cannot be separated from rRNA based on this 

25 property. 

One approach to separating rRNA from mRNA is to deplete the rRNA from 
the sample. One example, is the hybridization of rRNA molecules using 
oligonucleotides homologous to the 17S rRNA, 18S rRNA, or 28S rRNA in the case 
of eukaryotic rRNAs, or to the 16S rRNA or 23S rRNA in the case of bacterial rRNA. 
30 The oligonucleotides are designed such that they can be "captured" and the 



WO 2006/029350 PCT/US2005/032227 

hybridization product removed from the sample. For example, the oligonucleotides 
may be immobilized on a surface such as a column or a bead. MICROBExpress™ 
and MICROBEnrich™ (Ambion, Austin, TX) are examples of commercially 
available kits for the depletion of rRNA. Methods and compositions for the depletion 
5 or rRNA from a sample are described in U.S. Application Serial No. 10/029,397, 
which is incorporated by reference. 

5. Separation of Polyadenylated mRNA from Non- 
Polyadenylated mRNA 

In eukaryotic organisms, one trait shared by all mRNAs is the presence on the 
3' end of a stretch of tens to hundreds of adenine (A) residues. This has been used as 
a means for separating these molecules away from rRNA and other non-mRNA 
species that lack this "polyA tail." The standard protocol for the selection of polyA+ 
RNA is based on the method of Aviv and Leder (1972), wherein a short stretch of 
DNA consisting solely of thymidine (T) residues ('oligo-dT') is affixed to an 
insoluble matrix. The original procedure used oligo-dT covalently linked to cellulose 
poured into a column. This is then used as a selective immobilization matrix for 
mRNA in the sample by setting up conditions that favor formation of RNA-DNA 
double strands. The total RNA sample is applied to the column in an appropriate salt 
buffer (originally 0.5 M KC1 in 10 mM Tris, pH 7.5), encouraging hybridization to the 
polyA stretches found solely at the 3' ends of mRNA. The column is then subjected 
to extensive washing with the application buffer (containing 0.5 M KC1), then a 
lower-ionic-strength solution (0.1 M KC1), followed by elution of mRNA with 10 mM 
Tris (pH 7.5). Subsequent modifications on this original procedure have retained the 
basic process of hybridization to immobilized oligo-dT in approximately 0.5 M salt, 
but have changed the format from columns to batch procedures to allow the procedure 
to be performed faster and have used NaCl or LiCl as the salts. 

Further changes have been the replacement of cellulose with plastic or glass 
beads as the immobilization matrix, some of which are impregnated with ferrous 
material giving them a magnetic quality. This magnetic quality allows such magnetic 
30 " beads, as they are referred to, to be batch isolated on magnetic stands rather than 
requiring gravity or centrifugal force to pellet or filter separate. A further wrinkle in 
the procedure is the use of a biotin-streptavidin linkage in the connection between 
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oligo-dT and bead, where the oligo is biotinylated and the bead is covalently coupled 
to streptavidin. The hybridization can be performed in solution with this procedure, 
linking the oligo-dT-mRNA hybrids to the beads in a subsequent step. 

Bacteria, however, lack the relatively stable poly(A) tails found on eukaryotic 
5 messages. The inventors have exploited this difference between eukaryotic mRNA 
and bacterial mRNA to efficiently separate the two populations of mRNA from mixed 
samples. By selectively hybridizing mRNA with poly(A) tails to immobilized oligo- 
dT molecules, the eukaryotic mRNA can be depleted from a sample while the 
bacterial mRNA remains in the sample. 

10 6. Nucleic Acid Tails 

The present invention provides methods for tailing a targeted RNA molecule 
in a sample with a nucleic acid tail. The nucleic acid tail may be any nucleic acid 
sequence. For example, the nucleic acid tail may be a homopolymeric sequence, such 
as a poly(A), poly(T), poly(G), poly(C), or poly(U) sequence. 

15 Methods for adding nucleic acid sequences to RNA molecules are known to 

those of skill in the art. For example, to add a poly(A) sequence to the 3' end of an 
RNA, one could use a poly(A) polymerase, such as poly(A) polymerase I of E. coli or 
yeast poly(A) polymerase. The present invention provides improvements over prior 
art methods of polyadenylating mRNA molecules by providing conditions that 

20 increase the efficiency of polyadenylation and limit the size of the poly(A) tails. 

Seemingly straightforward and simple, the polyadenylation of complex RNA 
populations is actually a quite complicated reaction. For polyadenylating bacterial 
mRNAs, the inventor used purified poly(A) polymerase I (PAP I) enzyme of E. coli. 
PAP I or PAP adds AMP to 3' ends of RNA molecules using ATP as a donor. The 
25 efficiency of PAP has been found to vary with different substrate RNAs. RNAs with 
3' end nucleotides that are paired, such as in a hairpin, are less efficiently 
polyadenylated (Feng and Cohen, 2000). The enzyme uses an ordered bi-bi kinetic 
mechanism and adds As distributively (Eun, 1996). Initiation and elongation occur at 
different rates, and the enzyme dissociates from the primer after every step. 
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To ensure representative amplification of a population of bacterial mRNAs, it 
is important that essentially all mRNAs in a population of bacterial mRNA are 
poly(A) tailed. However, using standard reaction conditions for commercially 
available PAP enzymes and synthetic RNA transcripts, the inventor found that only 
5 about 90-95% of a transcript could be tailed. 

Another drawback to using the standard reaction conditions for commercially 
available PAP enzymes was that for small mass amounts of RNA, such as those likely 
to be used for amplification (<100 ng), the tail lengths were extremely long - up to 9 
kb in length. Excessive tail-length could inhibit amplification reactions through 

10 several possible mechanisms: (1) reverse transcriptase may dissociate from the 
template during polymerization through extremely long homopolymeric A tracts; (2) 
dTTP may be effectively exhausted during 1 st strand synthesis, slowing the reaction 
rate; (3) excess poly(A) may itself be detrimental to reaction kinetics; (4) UTP may 
become limiting during the in vitro transcription step; and/or (5) T7 polymerase may 

15 be hindered while incorporating long U tracts at the 5' ends of antisense RNAs 
(aRNAs). 

Limiting tail lengths and ensuring that all of the targeted RNA is tailed are 
intricately linked. Achieving one of the goals usually adversely affects the other. To 
optimize this procedure, the inventors altered numerous reaction constituents and 
20 parameters and tested novel reaction components and conditions. 

Two reaction modifications: (1) altering the secondary structure of the RNA 
by either adding a single stranded binding protein (e.g., T4 Gene 32 protein) or 
denaturing the RNA prior to its addition to the PAP reaction; and (2) removing Mn 2+ 
from the "standard" PAP reaction buffer enabled the tailing of essentially 99-100% of 
25 a transcript. This was verified by analyzing reactions on RNA LabChips and by 
quantifying radiolabeled transcripts before and after tailing with a phosphorimager. 
Both methods indicated that <1% of the untailed transcript remains following the PAP 
reaction. 

Limiting the size of the poly(A)-tails appended to mRNAs was accomplished 
30 by drastically reducing the ATP concentration in the reaction to 25-70 p.M for target 
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RNA amounts in the range of 1-100 ng and by reducing the reaction time to 5-15 min 
at a temperature of 37°C. 

The improved method of polyadenylating bacterial mRNA provided by the 
present invention, which ensures that essentially all of the transcript is tailed and that 
5 limits tail lengths, enables genome-wide expression analysis of bacterial genes by 
enabling representative amplification of bacterial mRNA molecules. Moreover, the 
methods and composition of the present invention can be applied to any non- 
polyadenylated RNA. 

7. Amplification of Antisense RNA 

10 In some embodiments, the present invention provides methods and 

compositions for the amplification of targeted RNA molecules. Amplification of 
targeted RNA molecules is desirable when the amount of targeted RNA in a sample is 
too low for microarray analysis or other applications. 

In certain aspects, the target RNA is amplified by the processive synthesis of 
15 multiple RNA molecules from a single cDNA template, which results in amplified, 
antisense RNA (aRNA). Methods for the synthesis of aRNA are described in U.S. 
Patents 5,545,522, 5,716,785, and 5,891,636 all of which are incorporated herein by 
reference. 

In typical embodiments, these methods involve the incorporation of an RNA 
20 polymerase promoter into a cDNA molecule by priming cDNA synthesis with a 
primer complex comprising a synthetic oligonucleotide containing the promoter. 
Following synthesis of double-stranded cDNA, a reverse transcriptase is added, and 
antisense RNA is transcribed from the cDNA template. The amplification, which will 
typically be at least about 500-fold, but may be at least about 1,000-, 5,000-, 10,000-, 
25 15,000-, or 20,000-fold or more, can be achieved from nanogram quantities or less of 
cDNA, and is economical and simple to perform under standard molecular biology 
laboratory conditions. 

One advantage that the processive synthesis of aRNA has over PCR is that 
only one region of shared sequence need be known to synthesize aRNA; PCR 
30 generally requires that shared sequences be known both 5'- and 3'- to the region of 
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interest, and that these flanking regions be sufficiently close to allow efficient 
amplification. 

The synthesis of aRNA is useful for amplifying sequences of interest from 
small amounts of nucleic acid. The aRNA amplification is uniform among individual 
5 sequences, and thus, it is useful in estimating relative levels of representation of a 
given sequence relative to other sequences within a population. Such quantitative 
resolution finds use in a variety of application. Of particular interest is the genome- 
wide expression analysis of bacterial genes. 

For example, aRNA can be amplified from limited amounts of clinical 
10 material to allow pathogen-specific sequences to be identified. In a preferred 
embodiment, the present invention polyadenylates bacterial mRNAs, and then uses an 
amplification primer with a polythymidylate region that recognizes the commonly 
shared poly(A) tail. Preferably the promoter region of the amplification primer is 
derived from the SP6, T3, or T7 phage. The RNA polymerase used for the 
15 transcription must be capable of operably binding to the particular promoter region 
employed in the promoter-primer complex. A preferred RNA polymerase is that 
found in bacteriophages, in particular T3 and T7 phages. 

8, Hybridization 

As used herein, "hybridization", "hybridizes" or "capable of hybridizing" is 
20 understood to mean the forming of a double or triple stranded molecule or a molecule 
with partial double or triple stranded nature. The term "anneal" as used herein is 
synonymous with "hybridize." The teim "hybridization", "hybridize(s)" or "capable 
of hybridizing" encompasses the terms "stringent condition(s)" or "high stringency" and 
the terms "low stringency" or "low stringency condition(s)." 

25 As used herein "stringent condition(s)" or "high stringency" are those 

conditions that allow hybridization between or within one or more nucleic acid 
strand(s) containing complementary sequence(s), but precludes hybridization of 
random sequences. Stringent conditions tolerate little, if any, mismatch between a 
nucleic acid and a target strand. Such conditions are well known to those of ordinary 

30 skill in the art, and are preferred for applications requiring high selectivity. Non- 
limiting applications include isolating a nucleic acid, such as a gene or a nucleic acid 
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segment thereof, or detecting at least one specific mRNA transcript or a nucleic acid 
segment thereof, and the like. 

Stringent conditions may comprise low salt and/or high temperature 
conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures 

5 of about 50°C to about 70°C. Alternatively, stringent conditions may be determined 
largely by temperature in the presence of a TMAC solution with a defined molarity 
such as 3M TMAC. For example, in 3 M TMAC, stringent conditions include the 
following: for complementary nucleic acids with a length of 15 bp, a temperature of 
45 °C to 55 °C; for complementary nucleotides with a length of 27 bases, a 

10 temperature of 65 °C to 75 °C; and, for complementary nucleotides with a length of 
>200 nucleotides, a temperature of 90 °C to 95 °C. It is understood that the 
temperature and ionic strength of a desired stringency are determined in part by the 
length of the particular nucleic acid(s), the length and nucleobase content of the target 
sequence(s), the charge composition of the nucleic acid(s), and to the presence or 

15 concentration of formamide, tetramethylammonium chloride (TMAC), tetraethyl 
ammonium chloride (TEAC), or other salts or solvent(s) in a hybridization mixture. 

It is also understood that these ranges, compositions and conditions for 
hybridization are mentioned by way of non-limiting examples only, and that the 
desired stringency for a particular hybridization reaction is often determined 

20 empirically by comparison to one or more positive or negative controls. Depending 
on the application envisioned it is preferred to employ varying conditions of 
hybridization to achieve varying degrees of selectivity of a nucleic acid towards a 
target sequence, hi a non-limiting example, identification or isolation of a related 
target nucleic acid that does not hybridize to a nucleic acid under stringent conditions 

25 may be achieved by hybridization at low temperature and/or high ionic strength. Such 
conditions are termed "low stringency" or "low stringency conditions," and non- 
limiting examples of low stringency include hybridization performed at about 0.15 M 
to about 0.9 M NaCl at a temperature range of about 20°C to about 50°C. Of course, 
it is within the skill of one in the art to further modify the low or high stringency 

30 conditions to suite a particular application. 
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9, Nucleic Acid Arrays 

Because the present invention provides efficient methods and compositions for 
the purification and global amplification of target RNA molecules from complex 
RNA mixtures or from samples with limited amounts of RNA, it enables the detection 

5 of previously undetectable RNA molecules with nucleic acid arrays. In a particular 
embodiment, the invention provides methods and compositions for the analysis of 
bacterial mRNA expression using nucleic acid arrays. The term a "nucleic acid array" 
refers to a plurality of probe elements, each probe element comprising one or more 
nucleic acid molecules immobilized on one or more solid surfaces to which sample 

1 0 nucleic acids can be hybridized. 

Microarrays are known in the art and consist of a surface to which probes that 
correspond in sequence to gene products (e.g., cDNAs, mRNAs, cRNAs, 
polypeptides, and fragments thereof), can be specifically hybridized or bound at a 
known position. In one embodiment, the microarray is an array (i.e., a matrix) in 

15 which each position represents a discrete binding site for an RNA, and in which 
binding sites are present for products of most or almost all of the genes in the 
organism's genome. In a preferred embodiment, the "binding site" (hereinafter, 
"site"") is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA 
or aRNA can specifically hybridize. The nucleic acid or analogue of the binding site 

20 can be, e.g., a synthetic oligomer, a full-length cDNA, a less-than full length cDNA, 
or a gene fragment. A microarray may contain binding sites for products of all or 
almost ail genes in the target organism's genome, but such comprehensiveness is not 
necessarily required. 

The nucleic acid or analogue is attached to a solid support, which may be 
25 made from glass, plastic {e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, 
or other materials. A preferred method for attaching the nucleic acids to a surface is 
by printing on glass plates, as is described generally by Schena et aL, 1995. See also 
DeRisi et aL, 1996; Shalon et aL, 1996; Schena et aL, 1996. Each of these articles is 
incorporated by reference in its entirety. 

30 Other methods for making microarrays, e.g., by masking (Fodor et aL, 1991; 

Maskos and Southern, 1992), may also be used. In principal, any type of array, for 

an 
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example, dot blots on a nylon hybridization membrane (see Sambrook et al, 1989, 
which is incorporated in its entirety for all purposes), could be used, although, as will 
be recognized by those of skill in the art, very small arrays will be preferred because 
hybridization volumes will be smaller. 

Labeled cDNA is prepared from mRNA by oligo dT-primed or random- 
primed reverse transcription, both of which are well known in the art. Reverse 
transcription may be carried out in the presence of a dNTP conjugated to a detectable 
label, most preferably a fluorescently labeled dNTP. Alternatively, isolated mRNA 
can be converted to labeled antisense RNA synthesized by in vitro transcription of 
double-stranded cDNA in the presence of labeled dNTPs (Lockhart et al, 1996, 
which is incorporated by reference in its entirety for all purposes). In alternative 
embodiments, the cDNA or aRNA probe can be synthesized in the absence of 
detectable label and may be labeled subsequently, e.g., by incorporating biotinylated 
dNTPs or rNTP, or some similar means {e.g., photo-cross-linking a psoralen 
derivative of biotin to RNAs), followed by addition of labeled streptavidin {e.g., 
phycoerythrin-conjugated streptavidin) or the equivalent. Alternatively, cDNA or 
aRNA can be labeled indirectly by incorporation of 5-(3-aminoallyl) dNTPs or rNTPs 
to provide a amine reactive group for subsequent addition of label with any moiety 
bearing an N-Hydroxysuccinimide (NHS) ester. 

20 Fluorescently labeled probes can be used, including suitable fluorophores such 

as fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, 
Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others (see, e.g., Kricka, 1992). It 
will be appreciated that pairs of fluorophores are chosen that have distinct emission 
spectra so that they can be easily distinguished. In another embodiment, a label other 

25 than a fluorescent label is used. For example, a radioactive label, or a pair of 
radioactive labels with distinct emission spectra, can be used (see Zhao et al, 1995; 
Pietue* al, 1996). 

The analysis of microarray data may be accomplished using methods of 
statistical analysis known to those skilled in the art. For example, clustering analysis 
30 is commonly used for interpretation of microarray data. It provides both a visual 
representation of complex data and a method for measuring similarity between 
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experiments. Some widely used methods for clustering microarray data include: 
hierarchical, K-means, and self-organizing map. 

C. KITS 

Any of the compositions described herein may be comprised in a kit. In a 
5 non-limiting example the kit, in suitable container means, comprises: a poly(A) 
polymerase; ATP; an optimized tailing buffer; and a control RNA. In certain 
embodiments, the kit further comprises one or more of: an RNA polymerase; a single 
strand binding protein; ethylenediaminetetraacetic acid (EDTA); a reverse 
transcriptase; a first strand buffer; a promoter-oligo-dT primer; a dNTP mix; a 

10 ribonuclease inhibitor; a second strand buffer; a DNA polymerase; RNase H; nuclease 
free water; an RNA polymerase reaction buffer; ATP; CTP; GTP; UTP; TTP; DNase 
I; a cDNA binding buffer; a cDNA wash buffer; an aRNA binding buffer; an aRNA 
wash buffer; an elution solution; an aRNA filter cartridge; a cDNA filter cartridge; or 
collection tubes. In certain embodiments the kit may also include, an rRNA depleting 

15 agent, a DNA depleting agent, labeling agents, or components for isolating poly(A) 
itlRNA. 

In some embodiments the kit includes one or more of the following, but is not 
limited to these components or their precise compositions: a control RNA {e.g., E. 
coli total RNA); EDTA; poly(a) polymerase (e.g., E. coli PAP); 10X optimized tailing 

20 buffer (500 mM Tris-HCl, pH 8.3; 750 mM KC1; 30 mM MgCl 2 ; 100 mM DTT); 
ATP; reverse transcriptase (e.g., AMV-RT, MMLV, Arrayscript MMLV, or other 
mutant reverse transcriptase); 10X first strand buffer (500 mM Tris-HCl, pH 8.0; 750 
mM KC1; 30 mM MgCl 2 ; 100 mM DTT); first strand primer (e.g., a T7-oligo-d(T) 
primer); dNTP mix (5 mM dATP, 5 mM dCTP, 5 mM dGTP, 5 mM dTTP in 10 mM 

25 Tris, pH 8.0); ribonuclease inhibitor (e.g., placental ribonuclease inhibitor); 10X 
second strand buffer (150 mM Tris, pH 7.0; 850 mM KC1; 50 mM MgCl 2 ; 50 mM 
(NH 4 ) 2 S0 4 ); DNA polymerase (e.g., E. coli DNA polymerase or T4 DNA 
polymerase); RNase H; nuclease free water; T7 enzyme mix (T7 RNA polymerase, 
RNase inhibitor protein (RIP), inorganic pyrophosphatase (IPP), SUPERaseln™); 

30 10X T7 reaction buffer (400 mM Tris, pH 8.0; 240-260 mM MgCl 2 ; 20 mM 
spermidine; 100 mM DTT); CTP; GTP; UTP; DNase I; cDNA binding buffer (25 mM 
Tris, pH 7.0; 5.5 M guanidium thiocyanate); cDNA wash buffer (80% ethanol); 
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aRNA binding buffer (4 M GITC; 0.5% N-lauryl sarcosine; 25 mM Na citrate, pH 
7.2; 0.1 M 2-mercaptoethanol); aRNA wash buffer (80% ethanol); elution solution 
(0.1 mM EDTA, pH 8.0); aRNA filter cartridges; 2 ml tubes; and cDNA filter 
cartridges. It is contemplated that any of these components can be included in a kit 
5 with any of the other components. 

The kits may comprise suitably aliquoted nucleic acid compositions of the 
present invention, whether labeled or unlabeled, as may be used to isolate, deplete, or 
separate a targeted nucleic acid. The components of the kits may be packaged either 
hi aqueous media or in lyophilized form. The container means of the kits will 

10 generally include at least one vial, test tube, flask, bottle, syringe or other container 
means, into which a component may be placed, and preferably, suitably aliquoted. 
Where there is more than one component in the kit, the kit also will generally contain 
a second, third or other additional container into which the additional components 
may be separately placed. However, various combinations of components may be 

15 comprised in a vial. The kits of the present invention also will typically include a 
means for containing the nucleic acids, and any other reagent containers in close 
confinement for commercial sale. Such containers may include cardboard containers 
or injection or blow-molded plastic containers into which the desired vials are 
retained. 

20 When the components of the kit are provided in one or more liquid solutions, 

the liquid solution is an aqueous solution, with a sterile aqueous solution being 
preferred. 

However, the components of the kit may be provided as dried powder(s). 
When reagents and/or components are provided as a dry powder, the powder can be 
25 reconstituted by the addition of a suitable solvent. It is envisioned that the solvent 
may also be provided in another container means. 

The container means will generally include at least one vial, test tube, flask, 
bottle, syringe and/or other container means, into which the nucleic acid formulations 
are placed, preferably, suitably allocated. The kits may also comprise a second 
30 container means for containing a buffer and/or other diluent. 
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The kits of the present invention will also typically include a means for 
containing the vials in close confinement for commercial sale, such as, e.g., cardboard 
boxes or injection and/or blow-molded plastic containers into which the desired vials 
are retained. 

Such kits may also include components that facilitate isolation of nucleic 
acids, such as filters, beads, or a magnetic stand. Such kits generally will comprise, in 
suitable means, distinct containers for each individual reagent or solution as well as 
for the targeting agent. 

A kit will also include instructions for employing the kit components as well 
the use of any other reagent not included in the kit. Instructions may include 
variations that can be implemented. 

D, EXAMPLES 

The following examples are included to demonstrate preferred embodiments 
of the invention. It should be appreciated by those of skill in the art that the 
techniques disclosed in the examples which follow represent techniques discovered by 
the inventor to function well in the practice of the invention, and thus can be 
considered to constitute preferred modes for its practice. However, those of skill in 
the art should, in light of the present disclosure, appreciate that many changes can be 
made in the specific embodiments which are disclosed and still obtain a like or similar 
result without departing from the spirit and scope of the invention. 

EXAMPLE 1 
Optimized Polyadenylation of Bacterial mRNA 

25 Seemingly straightforward and simple, the polyadenylation of complex RNA 

populations is actually a quite complicated reaction. For polyadenylating bacterial 
mRNAs, the inventors used purified poly(A) polymerase I (PAP I) enzyme of E. coll 
PAP I or PAP adds AMP to 3' ends of RNA molecules using ATP as a donor. The 
efficiency of PAP has been found to vary with different substrate RNAs. RNAs with 

30 3' end nucleotides that are paired, such as in a hairpin, are less efficiently 
polyadenylated (Feng and Cohen, 2000). The enzyme uses an ordered bi-bi kinetic 
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mechanism and adds adenines distributively (Eun, 1996). Initiation and elongation 
occur at different rates, and the enzyme dissociates from the primer after every step. 

The initial goal was to ensure that all mRNAs in a population of bacterial 
mRNA were poly(A) tailed, thereby ensuring that they will be templates for the 1 st 

5 strand cDNA synthesis reaction - the first step in amplification. Using the standard 
reaction conditions for commercially available PAP enzymes and synthetic RNA 
transcripts, the inventor found that -90-95% of a transcript could be tailed. It was 
also observed that with small mass amounts of RNA, such as those likely to be used 
for amplification (<100 ng), the tail lengths were extremely long - up to 9 kb in 

10 length. The inventors surmised that excessive tail-length could inhibit amplification 
reactions through several possible mechanisms: (1) reverse transcriptase may 
dissociate from the template during polymerization through extremely long 
homopolymeric A tracts; (2) dTTP may be effectively exhausted during 1 st strand 
synthesis, slowing the reaction rate; (3) excess poly(A) may itself be detrimental to 

15 reaction kinetics; (4) UTP may become limiting during the in vitro transcription step; 
and/or (5) T7 polymerase may be hindered while incoiporating long U tracts at the 5' 
ends of antisense RNAs (aRNAs). 

To ensure more complete mRNA polyadenylation, and to limit tail lengths to a 
reasonable size (preferably <500 base pairs), new methodologies were developed. 

20 To effectively monitor polyadenylation of mRNAs, several synthetic 

transcripts of various sizes (300 - 1500 nt) were individually evaluated, as even 
simple mixtures of a few transcripts were cumbersome to evaluate. Limiting tail 
lengths and ensuring that all of the transcript is tailed are intricately linked. 
Achieving one of the goals usually adversely affected the other. To optimize this 

25 procedure, the inventors altered numerous reaction constituents and parameters and 
tested novel reaction components and conditions. 

The inventors first modified the reaction time (0-30 min), ATP amount (0-1 
mM), and U of PAP (0-2 U) in the reaction. It was generally observed that longer 
reaction times, higher ATP concentration, and more PAP increased the percentage of 
30 the transcript that was tailed, but also increased the lengths of A tails. Under 
conditions where tail lengths were acceptably reduced, too much transcript was left 
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untailed. The inventors also tested the chain-terminating nucleotides cordycepin and 
ddATP (0-50% exchange with ATP or with ATP held constant) and observed the 
same effect. 

Tailing was also evaluated using thio-ATP, dATP, CTP, GTP, and UTP. GTP 
5 was not a substrate. CTP and UTP effectively limited tail lengths, leaving some 
untailed transcript. UTP would be a more useful substitute than CTP, because 
chemically synthesizing a poly(G) oligonucleotide (for subsequent priming of 1 st 
strand synthesis on C-tailed RNA) is problematic. None of these modifications 
appeared to be superior to using ATP. 

10 Two reaction modifications: (1) adding a single stranded binding protein (e.g-., 

T4 Gene32 protein); and (2) denaturing RNAs prior to their addition to the PAP 
reaction enabled the tailing of essentially 99-100% of a transcript. This was verified 
by analyzing reactions on RNA LabChips and by quantifying radiolabeled transcripts 
before and after tailing with a phosphorimager. Both methods indicated that <1% of 

1 5 the untailed transcript remains following the PAP reaction. 

Limiting the size of A-tails appended to mRNAs was accomplished by 
drastically reducing the ATP concentration in the reaction to 25-50 |iM and by 
reducing the reaction time to 5-15 min at 37°C. FIG. 1 shows an Agilent Bioanalyzer 
electropherogram depicting a synthetic GAPDH transcript before and after tailing. In 
20 this example, the untailed transcript is -315 nucleotides, and the majority of the tailed 
transcript is -570 nucleotides. However, some transcripts are tailed to greater 
lengths. 



EXAMPLE 2 

25 Effect of MnCl 2 Concentration on Polyadenylation Reaction 

This experiment was designed to test whether or not the addition of MnCk has 
an effect on poly(A) tailing efficiency and tail length. The reaction utilized 100 ng of 
a synthetic in vitro transcribed RNA of 387 nucleotides as template. The reactions 
were performed as follows: 
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RNA template (100 ng) and H 2 0 to a final volume of 10 \xl were combined in a 
0,5 ml tube. The RNA solution was heat denatured at 70°C for 10 minutes and then 
quenched on ice for 3 minutes. The other reactants were then added as a 15 \xl 
cocktail containing: 5 jul 5X Polyadenylation buffer, 1.25 |il 1 mM ATP, 1 \x\ 

5 Ribonuclease inhibitor (40 U/jli1) 9 1 \il E. coli Poly(A) Polymerase (2 U/^il) and 4.25 
|al H2O. Depending on the reaction (with MnCl 2 or without MnCl 2 ) either 2.5 |il 25 
mM MnCl 2 or 2.5 (il H 2 0 was also added. The reactions were allowed to incubate at 
37°C for various time points. At the end of this incubation 2 jllI of 0.5 M EDTA was 
added to stop the reactions. The negative control had EDTA added prior to the 

10 reactant cocktail, thus preventing tailing. Finally the reactions were purified with a 
variation of Ambion's MegaClear glass-fiber based RNA purification kit. 

Shown in FIG. 2 is an Agilent Bioanalyzer electropherograms depicting the 
extent of polyadenylation for various reaction conditions. All of the polyadenylation 
reaction was run on the Bioanalyzer. As can be seen in FIG. 2, MnCl 2 had a dramatic 
15 effect on tail length. When the reaction was performed with MnCl 2 [2.5 mM] tail 
length extended from -0.5 Kb to beyond 6 Kb. When MnCl 2 was not added to the 
reaction, tail length was reduced dramatically, to a median size of -150 nt. While 
incubation time had some effect, it was small relative to the MnC12 effect. 

20 EXAMPLE 3 

Bacterial RNA Amplification 

In this experiment total E. coli RNA was polyadenylated and amplified. Two 
RNA preparations were used (lot # PR043921 and the same RNA after purification 
over a glass fiber filter column) for amplification and two in vitro transcription (IVT) 
25 incubation times were compared (6 hours and 14 hours). 

Total RNA template (10 ng or 100 ng) and H 2 0 to a final volume of 5 \xl were 
combined in a 0.5 ml tube. The RNA solution was heat denatured at 70°C for 10 
minutes and then quenched on ice for at 3 minutes. The other reactants were then 
added as a 5 |il cocktail containing: IX Polyadenylation buffer (50 mM Tris-HCl pH 
30 8.3, 75 mM KC1, 3 mM MgCl 2 and 10 mM DTT), 50 ^M ATP, 40 Units 
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Ribonuclease inhibitor Protein, 2 Units E. coli Poly(A) Polymerase. The reaction was 
incubated for 15 minutes at 37°C and placed on ice until the amplification step. 

Ambion's MessageAmp II RNA amplification kit was used to amplify the 
poly adenylated "bacterial RNA according to the manufacturer's instructions, with only 
5 minor changes. An anchored T7(dT)VN primer was used, and there was no 
denaturation step of the RNA/primer prior to reverse transcription. In vitro 
transcription was performed for either 6 or 14 hours. Table 1 below shows the aRNA 
yields and mean and the coefficient of variation expressed in percent (% CV) for 
triplicate reactions. 



Yield 

(va) 


Triplicate Mean 


%CV 


Description 


137.5245 
123.726 
128.604 


130.0 


5.38 


100ng total RNA-organic isolation, 14 hour IVT 


80.0175 
75.465 
78.387 


78.0 


2.96 


10ng total RNA-organic isolation, 14 hour IVT 


206.907 
190.065 
198.51 


198.5 


4.24 


100ng total RNA-glass fiber purified, 14 hour IVT 


68.6355 
72.537 
58.1715 


66.4 


11.18 


10ng total RNA-glass fiber purified, 14 hour IVT 










61.9035 
57.2445 
51.444 


56.9 


9.22 


100ng total RNA-organic isolation, 6 hour IVT 


33.3465 
36.6825 
30.8265 


33.6 


8.74 


10ng total RNA-organic isolation, 6 hour IVT 


108.4545 
110.667 
114.114 


111.1 


2.57 


100ng total RNA-glass fiber purified, 6 hour IVT 


22.0875 
31.8465 
39.987 


31.3 


28.63 


10ng total RNA-glass fiber purified, 6 hour IVT 



All of the compositions and methods disclosed and claimed herein can be 
made and executed without undue experimentation in light of the present disclosure. 
1 5 While the compositions and methods of this invention have been described in terms of 
preferred embodiments, it will be apparent to those of skill in the art that variations 
may be applied to the compositions and methods and in the steps or in the sequence of 
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steps of the methods described herein without departing from the concept, spirit and 
scope of the invention. More specifically, it will be apparent that certain agents which 
are both chemically and physiologically related may be substituted for the agents 
described herein while the same or similar results would be achieved. All such 
5 similar substitutes and modifications apparent to those skilled in the art are deemed to 
be within the spirit, scope and concept of the invention as defined by the appended 
claims. 
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CLAIMS : 

1 . A method for increasing the efficiency of tailing a targeted RNA and/or 
5 limiting the length of a tail added to the targeted RNA in a sample comprising, 

altering the secondary structure of the targeted RNA and incubating the targeted RNA 
in the presence of a tailing enzyme and a nucleotide under conditions that allow 
tailing of the targeted RNA. 

10 2. The method of claim 1 wherein the targeted RNA, the tailing enzyme, and the 
nucleotide, are incubated in the presence of substantially no Mn 2+ under conditions 
whereby the tail added to the majority of the targeted RNA molecules is about 20 
nucleotides to about 500 nucleotides long, 

15 3 . The method of claim 1 or claim 2, wherein the nucleotide is provided in a 
reduced amount. 

4. The method of claim 3, wherein the reduced amount of the nucleotide is 
between about 25-70 for 1-100 ng of the targeted RNA. 

20 

5. The method of any of claims 1 to 4, wherein the incubating is at about 37° C 
for between about 5 to 15 minutes. 

6. The method of any of claims 1 to 6, wherein altering the secondary structure 
25 of the targeted RNA comprises denaturing the targeted RNA. 

7. The method of claim 6, wherein denaturing the targeted RNA is further 
defined as heating the targeted RNA for at least about 10 minutes at a temperature of 
at least about 70°C. 

30 

8. The method of any of claims 1 to 6, wherein altering the secondary structure 
of the targeted RNA comprises adding a single strand binding protein. 
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9. The method of claim 8, wherein the single stranded binding protein is T4 Gene 
32 protein. 

1 0. The method of any of claims 1 to 8, wherein the tailing enzyme is a poly(A) 
5 polymerase. 

1 1 . The method of claim 1 0, wherein the poly(A) polymerase is poly(A) 
polymerase I of E. coli. 

10 12. The method of claim 1 0, wherein the poly(A) polymerase is yeast poly(A) 
polymerase. 

13. The method of any of claims 1 to 12, wherein the nucleotide is ATP, GTP, 
TTP, UTP, or CTP. 

15 

14. The method of any of claims 1 to 13, wherein the sample comprises a mixture 
of eulcaryotic RNA and prokaryotic RNA. 

15. The method of any of claims 1 to 15, wherein the targeted RNA is a non- 
20 polyadenylated mRNA; an miRNA, an siRNA, or a synthetic RNA. 

16. The method of claim 15, wherein the non-polyadenylated mRNA is a bacterial 
mRNA. 

25 17. The method of claim 15, wherein the non-polyadenylated mRNA is a 
degraded mRNA. 

1 8. The method of claim 15, wherein the miRNA is obtained from a cell, cell 
culture, a body fluid, a tissue, or an organ. 

30 

19. The method of claim 15, wherein the siRNA is obtained from a cell, cell 
culture, a body fluid, a tissue, or an organ. 
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20. The method of any of claims 1 to 19, wherein the sample is further defined as 
obtained from a cell, a cell culture, a body fluid, a tissue, or an organ. 

21 . The method of any of claims 1 8 to 20, wherein the tissue is a fixed tissue or a 
frozen tissue. 

22. The method of claim 21, wherein the fixed tissue is a formalin-fixed paraffin 
embedded tissue. 

23. The method of any of claims 1 to 20, wherein the sample is an environmental 
sample. 

24. The method of claim 23, wherein the environmental sample is a soil sample, a 
water sample, or an air sample. 

25. The method of any of claims 1 to 24, further comprising depleting 
polyadenylated mRNA from the sample prior to tailing the targeted RNA. 

26. The method of claim 25, wherein depleting the polyadenylated mRNA 
comprises hybridizing the polyadenylated mRNA to a poly(T) oligonucleotide. 

27. The method of claim 26, wherein the poly(T) oligonucleotide is attached to a 
surface. 

28. The method of claim 27, wherein the surface is a column or a bead. 

29. The method of claim 28, wherein the bead is a magnetic bead. 

30. The method of any of claims 1 to 29, further comprising depleting rRNA from 
the sample. 

3 1 . The method of claim 30, wherein the rRNA is eukaryotic rRNA. 

32. The method of claim 30, wherein the rRNA is bacterial rRNA. 
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33. The method of any of claim 1 to 32, wherein at least about 95% of the targeted 
RNAis tailed. 

5 34. The method of claim 33, wherein at least about 99% of the targeted RNA is 
tailed. 

35. The method of any of claims 1 to 34, wherein the tail on the majority of the 
targeted RNA molecules comprises about 100 nucleotides to about 300 nucleotides. 

10 

36. The method of any of claims 1 to 35, further comprising amplifying the 
targeted RNA. 

37. The method of claim 36, wherein amplifying the targeted RNA comprises: 

(a) hybridizing the targeted RNA with a promoter-oligo-dN primer; 

(b) extending the promoter-oligo-dN primer using a reverse transcriptase 
to form a first strand complementary to the targeted RNA; 

(c) synthesizing a second strand complementary to the first strand; and 

(d) transcribing copies of RNA initiated from the promoter-oligo-dN 
primer using an RNA polymerase, wherein the RNA is complementary 
to the second strand. 

3 8 . The method of claim 3 7, wherein transcribing involves a T bacteriophage 
RNA polymerase. 
25 

39. The method of claim 38, wherein the T bacteriophage RNA polymerase is T7 
RNA polymerase or T3 RNA polymerase. 

40. The method of any of claims 37 to 39, wherein the promoter-oligo-dN primer 
30 isaT7-oligo~dT. 

41 . The method of claim 40, wherein the T7-oligo-dT is T7-dT 24 VN. 



15 



20 
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42. The method of any of claims 37 to 41 , wherein the reverse transcriptase is 
MMLV reverse transcriptase. 

43. The method of claim 42, wherein the MMLV reverse transcriptase is a mutant 
5 MMLV reverse transcriptase. 

44. The method of any of claims 1 to 43 further comprising using the targeted 
RNA for expression analysis. 

10 45 . The method of claim 44, wherein the expression analysis comprises 
hybridizing the targeted RNA to a microarray. 

46. A kit for the practice of a method of any of claims 1 to 46, in suitable 
container means, comprising: 
15 (a) ATP; 

(b) a poly(A) polymerase; 

(c) an optimized tailing buffer; and 

(d) a control RNA. 

20 47. The kit of claim 46, wherein the poly(A) polymerase is poly(A) polymerase I 
of E. coli. 



48. The kit of claim 46, wherein the poly(A) polymerase is yeast poly(A) 
polymerase. 

25 

49. The kit of any of claims 46 to 48 further comprising one or more of a 
promoter-oligo-dT primer, a reverse transcriptase, an RNase inhibitor, a dNTP mix, 
RNase H, an RNA polymerase, DNase I, a DNA polymerase, or a single strand 
binding protein. 

30 

50. The kit of claim 49, wherein the single strand binding protein is T4 Gene 32. 

5 1 . The kit of claim 49, wherein the promoter-oligo-dT primer promotes 
transcription by a T bacteriophage RNA polymerase. 
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52. The kit of claim 47, wherein the promoter-oligo-dT primer is an SP6-oligo-dT 
primer. 

5 53. The kit of claim 51, wherein the promoter-oligo-dT primer is a T7-oligo-dT 
primer. 

54. The kit of claim 53, wherein the T7-oligo-dT primer is T7-dT 24 VN. 

10 55. The kit of claim 49, wherein the RNA polymerase is T7 RNA polymerase or 
T3 RNA polymerase. 



15 



20 



56. The kit of claim 49, wherein the reverse transcriptase is MMLV reverse 
transcriptase. 

57. The kit of claim 56, wherein the MMLV reverse transcriptase is a mutant 
MMLV reverse transcriptase. 

58. The kit of claim 49, wherein the RNA polymerase is SP6 RNA polymerase. 



59. A method for amplifying targeted RNA comprising: 

(a) denaturing the targeted RNA; 

(b) incubating the targeted RNA in the presence of a poly(A) polymerase, 
a nucleotide, and an optimized tailing buffer under conditions that 

25 allow tailing of the targeted RNA; and 

(c) amplifying the tailed, targeted RNA. 



60. The method of claim 59, wherein denaturing the targeted RNA comprises 
heating the targeted RNA for at least about 10 minutes at a temperature of at least 
30 about 70°C. 



61 . The method of claim 59, wherein the poly(A) polymerase is poly(A) 
polymerase I of £. coli. 
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62. The method of claim 59, wherein the poly(A) polymerase is yeast poly(A) 
polymerase. 

63. The method of claim 59, wherein the nucleotide is ATP, GTP, TTP, UTP, or 
5 CTP. 

64. The method of claim 59, wherein the tailed, targeted RNA is a poly(A) tailed, 
targeted RNA. 



10 65 . The method of claim 64, wherein amplifying the poly(A) tailed, targeted RNA 
comprises: 

(a) hybridizing the poly(A) tailed, targeted RNA with a T7-oligo-dT 
primer; 

(b) extending the T7-oligo-dT primer using a reverse transcriptase to form 
15 a first strand complementary to the poly(A) tailed, targeted RNA; 

(c) synthesizing a second strand complementary to the first strand; and 

(d) transcribing copies of RNA initiated from the T7-oligo-dT primer 
using a T7 RNA polymerase, wherein the RNA is complementary to 
the second strand. 

20 

66. The method of claim 59 further comprising using the amplified RNA for 
expression analysis. 

67. The method of claim 66, wherein the expression analysis comprises 
25 hybridizing the amplified RNA to a microarray. 

68. A method for amplifying bacterial mRNA from a sample comprising bacterial 
cells and eukaryotic cells comprising: 

(a) isolating RNA from the sample; 

30 (b) depleting polyadenylated mRNA from the isolated RNA; 

(c) depleting rRNA from the isolated RNA; 

(d) tagging the bacterial mRNA; and 

(e) amplifying the bacterial mRNA. 
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69. A method for analyzing bacterial mRNA expression in a sample comprising 
bacterial RNA and eukaryotic RNA comprising: 

(a) isolating the bacterial RNA and the eukaryotic RNA from the sample 
to form an RNA composition; 
5 (c) depleting polyadenylated mRNA and rRNA from the RNA 

composition resulting in a relative enrichment of bacterial mRNA in 
the RNA composition; 

(d) polyadenylating the bacterial mRNA; and 

(e) amplifying the bacterial mRNA, 

10 wherein the amplified bacterial mRNA is used for expression analysis. 

70. A method for amplifying targeted RNA comprising: 

(a) denaturing the targeted RNA; 

(b) incubating the targeted RNA in the presence of a poly(A) polymerase 
1 5 and a reduced amount of a nucleotide under conditions that allow 

tailing of the targeted RNA; and 

(c) amplifying the tailed, targeted RNA. 

71. A method for amplifying targeted RNA comprising: 

20 (a) incubating the targeted RNA in the presence of a poly(A) polymerase, 

a reduced amount of a nucleotide, and an optimized tailing buffer 
under conditions that allow tailing of the targeted RNA; and 
(c) amplifying the tailed, targeted RNA. 

25 
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